SDSS galaxy bias from halo mass-bias relation and its cosmological implications 
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We combine the measurements of luminosity dependence of bias with the luminosity dependent 
weak lensing analysis of dark matter around galaxies to derive the galaxy bias and constrain ampli- 
tude of mass fluctuations. We take advantage of theoretical and simulation predictions that predict 
that while halo bias is rapidly increasing with mass for high masses, it is nearly constant in low mass 
halos. We use a new weak lensing analysis around the same SDSS galaxies to determine their halo 
mass probability distribution. We use these halo mass probability distributions to predict the bias 
for each luminosity subsample. Galaxies below L, are antibiased with 6 < 1 and for these galaxies 
bias is only weakly dependent on luminosity. In contrast, for galaxies above L* bias is rapidly 
increasing with luminosity. These observations are in an excellent agreement with theoretical pre- 
dictions based on weak lensing halo mass determination combined with halo bias-mass relations. 
We find that for standard cosmological parameters theoretical predictions are able to explain the 
observed luminosity dependence of bias over 6 magnitudes in absolute luminosity. We combine 
the bias constraints with those from the WMAP and the SDSS power spectrum analysis to derive 
new constraints on bias and as- For the most general parameter space that includes running and 
neutrino mass we find as = 0.88 ± 0.06 and &» = 0.99 ± 0.07. In the context of spatially flat models 
we improve the limit on the neutrino mass for the case of 3 degenerate families from m v < 0.6eV 
without bias to m„ < 0.18eV with bias (95% c.l.), which is weakened to m v < 0.24eV if running is 
allowed. The corresponding limit for 3 massless + 1 massive neutrino is 1.37eV. 



I. INTRODUCTION 

Galaxy clustering has long been recognized as a power- 
ful tool to constrain cosmology. Galaxies are assumed to 
trace dark matter on large scales and so the galaxy power 
spectrum can be related to that of the dark matter. The 
latter depends on several cosmological parameters, such 
as the epoch of matter-radiation equality, baryon to dark 
matter ratio and the primordial power spectrum shape 
and amplitude. The key assumption underlying this ap- 
proach is that galaxies trace dark matter up to an overall 
factor, called the linear bias 6, so that the galaxy and 
matter power spectra are related as Pg s (k) = b 2 Ps dnl (k), 
where S g and <5d m are the galaxy and dark matter den- 
sity fluctuations, respectively, and P(k) is their power 
spectrum. 

The linear bias assumption is thought to be accurate 
on large scales, but becomes less and less accurate on 
small scales, where details of galaxy formation play an 
important role. The exact transition scale between the 
linear and nonlinear regimes does not have to equal that 
of dark matter and may depend on the type of galaxies 
one is observing, the treatment of redshift space distor- 
tions and the cosmological model. For normal galaxies it 
is believed to be somewhere around k = 0.1 — 0.2ft,/Mpc 

HIQ. 

While the shape of the galaxy power spectrum for 
k < 0.2/i/Mpc has been used to constrain the cosmo- 



logical parameters [j, |5| , the overall amplitude is often 
ignored. The reason for this is that the galaxies can be 
biased relative to the dark matter and the bias parameter 
b depends on the galaxy properties, such as luminosity 
or type. This has long been observed as a function of 
morphological type J6| . More recent surveys emphasized 
the luminosity dependence of bias, finding that brighter 
galaxies cluster more strongly both in 2dF yj and in 
SDSS 8]. These early studies focused on the nonlinear or 
quasi-linear scales below 10ft. _1 Mpc, so it was not clear 
that their conclusions applied to the linear regime. In 
particular, on small scales the clustering strength can in- 
crease as a function of luminosity if the brighter galaxies 
preferentially populate larger halos with many galaxies 
inside, such as groups and clusters. In this case the in- 
crease in clustering amplitude on small scales is a reflec- 
tion of the enhanced correlations inside the halo and is 
not necessarily a reflection of these halos being more cor- 
related among themselves. In contrast with these previ- 
ous studies a recent analysis of SDSS galaxy survey data 
has focused the analysis on k < 0.2/i/Mpc (scales above 
10/i -1 Mpc) and thus measured the linear bias directly 9]. 
This analysis also found that bias increases as a function 
of luminosity, in a similar way as in the previous studies 
0, 8] ■ The relative amplitudes of fluctuation power spec- 
tra for different luminosity subsamples was found to differ 
by almost a factor of 2.5 from the bright to the faint end. 
It appears therefore that the evidence for linear bias in- 



creasing with luminosity has finally been established. In 
this situation it is not clear which of the galaxies are un- 
biased (6=1). In the absence of additional information 
the overall amplitude of galaxy fluctuations thus cannot 
be directly related to that of dark matter. 

There are at least three ways that have been proposed 
so far to break the degeneracy. One is to use redshift 
space distortions, which on large scales depend on the 
parameter (3 — Cl^/b, where f2 m is the matter den- 
sity of the universe. Unfortunately, even with modern 
surveys such as 2dF and SDSS, this parameter has con- 
siderable statistical uncertainty, so in itself this method 
cannot give a sufficiently precise bias determination. For 
example, the SDSS analysis gives (3 = 0.5 ± 0.2 on scales 
where nonlinear redshift space distortion modeling is re- 
liable 0. 

A second approach is to determine the bias from the 
bispectrum, as was done for 2dF galaxies [lfj. It is dif- 
ficult to measure the bispectrum on very large scales, 
where deviations from nongaussianity are small. In [lflj 
most of the weight comes from scales with 0.2/i/Mpc < 
k < 0.3/i/Mpc, in which case however it is not obvious 
that the bias measured there also applies to the larger 
scales where the power spectrum is measured, given how 
rapidly the nonlinear effects become important. In prin- 
ciple one could verify this with simulations, but pure N- 
body simulations (used so far) are not sufficient to verify 
this hypothesis, since galaxies do not trace dark matter 
on small scales and the details of how galaxies populate 
halos change the two and three point functions and de- 
pend on the specifics of the particular galaxy formation 
model. 

A third approach is to compare a weak lensing power 
spectrum determination, tracing dark matter, to the 
galaxy power spectrum or the weak lensing-galaxy cross- 
correlation. This approach can also give the bias directly, 
but is limited by the statistical power of weak lensing 
measurements. These are currently significantly more 
noisy than those of galaxies, since on large scales the 
weak lensing signal is weak and the survey areas probed 
so far are small. Current data sets have not yet reached 
the scales where linear theory is valid [Tj], E2- I n a d~ 
dition, shear calibration and background galaxy redshift 
distribution errors remain and may lead to errors as large 
as 20% on the linear amplitude [13|, Finally, so far these 
studies have averaged over galaxies covering a broad 
range of luminosity over which the bias changes signif- 
icantly, in which case comparing galaxy auto-correlation 
with galaxy-dark matter cross-correlation overestimates 
the bias even on linear scales, since (b 2 )/(b) > (b) (where 
() denotes averaging over luminosity). 

Given the difficulties of the methods described above, 
most workers adopt the conservative approach and ig- 
nore the overall amplitude of fluctuations (i.e., marginal- 
ize over a completely unconstrained bias factor when de- 
riving constraints on other parameters). Alternatively 
one can use other methods to determine the amplitude, 
such as the cluster abundance or weak lensing. These 



methods have systematics of their own, and at present 
various estimates of as vary by up to 30-40% (see |5| for 
a recent overview of current results). It would be useful 
to improve upon this situation, as the overall amplitude 
of fluctuations at zero redshift is an important source of 
information. It is especially useful to constrain the dark 
energy equation of state and neutrino mass. 

The purpose of this paper is to propose a new approach 
to determine the bias parameter and to apply it to SDSS 
data. The main element of the method is to relate the 
observed luminosity bias to ab initio predictions of the 
halo-to-mass bias relation. The relation between bias and 
halo mass is one of the fundamental predictions of large 
scale structure models [lj, [T^, [if! E3 • The bias predic- 
tions depend on the cosmological model, in particular on 
the nonlinear halo mass (defined below) where the bias is 
unity. Above this mass the bias is rapidly increasing with 
halo mass. Below this mass the bias is slowly decreasing 
to a value b ~ 0.7, i ndep endent of the other cosmological 
parameters [lj], LJJ| [J3 ■ If one determines observation- 
ally the bias as a function of halo mass one can compare 
it to theoretical predictions to establish the viability of 
this model. This is particularly simple if one observes 
directly the low mass plateau, since one can then deter- 
mine the absolute bias even without accurate halo mass 
determinations. 

If we determine the correlations of dark matter halos 
as a function of their mass we also determine the overall 
amplitude of fluctuations. However, we observe galax- 
ies, not dark matter halos, so we need to relate the two. 
While it is generally accepted that all galaxies form in 
halos we also know that the relation between the two is 
not one to one and galaxies of the same luminosity can be 
found in halos of different masses. For example, a typical 
galaxy like our Milky Way may be found at the center of 
a low mass halo with a typical size of 200kpc, it may be 
part of a small group with typical size of 500kpc or it may 
be a satellite in a cluster with a typical size of l-2Mpc. If 
we want to predict bias for a given luminosity subsample 
we must therefore determine the probability AP for a 
galaxy in this sample to be in a halo of mass M ± AM/2. 
To describe this we will use the conditional halo mass 
probability distribution dP/dM = p(M; L) (20], here- 
after GS02). It is important that the full distribution 
p(M; L) is determined and not just the mean halo mass 
at a given luminosity. This is because bias is a strong 
function of halo mass and some fraction of galaxies are 
known to be in very massive halos, which have a signifi- 
cantly larger bias than field galaxies. Even if only a small 
fraction of galaxies at a given luminosity are in clusters 
they can have a significant effect on the mean bias. 

In this work we determine the halo mass probability 
distribution for a given subsample using the weak lens- 
ing signal around these galaxies, the so called galaxy- 
galaxy (g-g) lensing. Gravitational lensing leads to tan- 
gential shear distortions of background galaxies around 
foreground galaxies [U |H |H |2l Hj| . The individual 
distortions are small, but by averaging over all galaxies 



within a given subsample we obtain high signal to noise 
in the shear as a function of angular separation from 
the galaxy. Since we know the galaxy redshift (the fore- 
ground galaxies are taken from the same spectroscopic 
sample also used to determine the galaxy power spec- 
trum), the shear signal can be related to the projected 
mass distribution as a function of proper distance from 
the galaxy 26] . This allows us to determine statistically 
the dark matter distribution around any given galaxy 
sample. With g-g lensing one can determine the full halo 
mass function, since small halos contribute only at small 
scales, while large halos such as clusters give rise to a 
signal also at larger 500-2000kpc scales typical of groups 
and clusters |2Cj . Because g-g lensing measures the sig- 
nal over a wide range of scales, this allows one to deter- 
mine the full halo mass function for a given subsample. 
In this paper we will take advantage of the latest SDSS 
data compilation based on 3800 square degrees of spec- 
troscopic data and imaging, a significant increase over 
the previous analyses of g-g lensing in SDSS |26l |27| . 

One of the key advantages of this method is that bias is 
a weak function of the halo mass, which in turn is deter- 
mined with high accuracy from the g-g lensing analysis. 
For example, the "re-Gaussianization" method of PSF 
correction used for this work methods allows a 2-3% cali- 
bration error due to the PSF dilution correction |2g|; the 
inclusion of other sources of systematic error as in [23 
raises the systematic error to roughly 10% (1-er), which 
leads roughly to a 15% error in the halo mass determi- 
nation. This in turn changes the bias only by 1% around 
6=1. The effect is even smaller for masses well below 
the nonlinear mass, where the bias approaches a constant 
independent of the halo mass. This is very different from 
the other methods of bias determination discussed above, 
where the bias is (at best) a linear function of the signal. 
For example, the same 10% weak lensing calibration error 
leads to a 10% error in bias if the large scale weak lensing- 
galaxy cross-spectrum is used as a method to determine 
the bias. The bias normalization can be predicted at any 
luminosity where the halo mass function can be deter- 
mined. Since we can determine the halo mass function 
at several luminosity bins this provides many consistency 
checks on the method and different bins can be averaged 
to reduce the statistical error on the bias. 

This method has other advantages as well. One is that 
we can use the same galaxies in the lensing analysis as 
in the galaxy clustering analysis. Using the SDSS data 
we can perform the g-g lensing analysis on the same lu- 
minosity subsamples as the ones used to obtain the large 
scale bias as a function of luminosity. There is no ambi- 
guity regarding the selection of the catalog, which often 
causes galaxies selected in different surveys to have differ- 
ent properties. Here we work exclusively with SDSS data. 
Another advantage is that the expression for large scale 
bias weights the galaxies linearly, just as does g-g lens- 
ing. This is important if the number of galaxies in a halo 
is stochastic, in which case for example a pair weighted 
statistic (such as galaxy clustering on small scales) differs 



from a linearly weighted statistic. 

G-g lensing is one of the most direct and model in- 
dependent methods to determine the halo probability 
distribution, short of observing the dark matter halos 
directly. In particular, this method avoids the prob- 
lems with optical (or X-ray, SZ etc.) identification of 
halos, which is only reliable for large halos such as 
clusters. Most of the galaxies are in low mass halos 
(10 11 Mq — 10 13 Mq), so it is important that the mass 
probability distribution is reliable in that range. Another 
approach to parametrize the halo occupation distribution 
is with a conditional luminosity function 30]. This was 
used to determine the bias in 2dF galaxies |3lJ , but in the 
absence of lensing information a more model dependent 
analysis had to be used. Initial results seemed to give 
higher bias than the bispectrum analysis of bias in 2dF 
[l(j , but this may be a consequence of using theoretical 
predictions from [lq. which overestimate bias by up to 
20% compared to simulations |19| . 

The remainder of the paper is organized as follows. In 
§2 we present an overview of the theory, first discussing 
the theoretical predictions for halo bias and then the re- 
lation between halos and galaxies within the context of 
halo models. The analysis of the galaxy clustering data, 
weak lensing and bias is presented in §3. The cosmo- 
logical implications of the results are presented in §4, 
followed by conclusions in §5. 



II. OVERVIEW OF THE THEORY 

In this section we first review the concept of halo bias 
and then the formalism which relates galaxies to dark 
matter halos. We also discuss weak lensing as a method 
to connect the two. 



A. Halo bias 

In current cosmological models structure grows hierar- 
chically from small, initially Gaussian fluctuations. Once 
the fluctuations go nonlinear they collapse into virializcd 
halos. The spatial density of halos as a function of their 
mass M is specified by the halo mass function dn/dM, 
which in general is a function of redshift z. It can be 
written as 



^L dM = 4-Jiy)dv, 

dM M JK ' ' 



(1) 



where p is the mean matter density of the universe. We 
introduced the function f(u), which can be expressed 
in units in which it has a theoretically universal form 
independent of the power spectrum or redshift if written 
as a function of peak height 



v = [8 c (z)/a(M)f 



(2) 



Here S c is the linear overdensity at which a spherical per- 
turbation collapses at z (S c — 1.68 for the spherical col- 
lapse model) and a(M) is the rms fluctuation in spheres 



that contain on average mass M at an initial time, ex- 
trapolated using linear theory to z. 

The first a naly tic model for the mass function has been 
proposed by [32]. While it correctly predicts the abun- 
dance of massive halos, it overpredicts the abundance of 
halos around the nonlinear mass scale M n \ (defined be- 
low). An improved version has been proposed by [16| . It 
has been shown that it can be derived analytically within 
the framework of the ellipsoidal collapse model [13 ■ The 
halo mass is defined in terms of the linking length pa- 
rameter of the friends-of-friends (FoF) algorithm, which 
is 0.2 for the simulations used in [l6(- This roughly cor- 
responds to spherical overdensity halos of 180 times the 
background density [22j • For the range of masses of in- 
terest here it is 30% larger than the mass defined as the 
mass within the radius where the density is 200 times 
the critical density J3J, [35|. To be specific, we will use 
Q m = 0.3 when computing the virial masses, so they are 
defined as the mass within the radius within which the 
mean density is 54 times the critical density. 

Just like the underlying dark matter the halos are cor- 
related among themselves. The correlation amplitude 
depends on the halo mass, with more massive halos be- 
ing more strongly clustered. This is called halo bias and 
can be easily understood within the the peak-background 
split of the spherical collapse model [Til Il5| : an underly- 
ing long wavelength density perturbation contributes to 
the threshold collapse value 5 C = 1.68, leading to a larger 
number of halos collapsing in a local overdensity of the 
background relative to an underdensity. The more mas- 
sive halos are more strongly clustered, with the strength 
related to the derivative of the mass function. 

What does the halo bias depend on? While the theo- 
retical predictions of [n| depend on the amplitude and 
shape of the power spectrum and the density parame- 
ter VL m , for the relevant models most of the dependence 
can be expressed in terms of the nonlinear mass M n \, de- 
fined as the mass enclosed in a sphere of radius within 
which the rms fluctuation amplitude is 1.68. In [l9j it 
was shown that a good fit to the bias relation from sim- 
ulations is given by 



b(x = M/M nX ) = 0.53 + 0.39s 
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Here VL m is matter density, ug is matter amplitude of 
fluctuations in spheres of 8/i~ 1 Mpc, h is the Hubble pa- 
rameter in units of lOOkm/s/Mpc, n s is the scalar slope 
at k = 0.05/Mpc and a s = dn s /dhik is the running of 
the slope, which we approximate as constant. This ex- 
pression should be accurate to about 0.03 over the range 
0.01 < x < 10. It improves upon previous fits to simu- 
lations [la. [lg , in particularly in the regime below the 
nonlinear mass, where previous expressions overestimate 
the bias by as much as 20%. It is clear that the dom- 
inant parameter for bias determination is halo mass in 



units of nonlinear mass, while the variations of cosmo- 
logical parameters produce only small deviations from 
the universality of this expression. 



B. Halo-galaxy connection 

In all of the current models of structure formation 
galaxies form inside dark matter halos. A galaxy of a 
given luminosity L can form in halos of different mass 
M. This is described by the conditional halo mass prob- 
ability distribution at a given luminosity p(M;L), nor- 
malized to unity when integrated over mass. The linear 
bias on large scales at a given luminosity is given by 



b(L) = / p(M; L)b{M)dM. 



(4) 



A given cosmological model determines 6(M); to deter- 
mine b(L) we therefore need p(M; L). 

As mentioned in the introduction the most direct 
route to the conditional mass probability distribution 
p(M; L) is via g-g lensing. This measures the tangen- 
tial shear distortions in the shapes of background galax- 
ies induced by the mass distribution around foreground 
galaxies. The shear distortions jt are very small, in 
our case 10~ 3 , while the typical galaxy shape noise is 
0.3. To extract the signal we must average over many 
foreground-background pairs. This results in a measure- 
ment of the shear-galaxy cross-correlation as a function 
of their relative separation on the sky. If the redshift 
of the foreground galaxy is known then one can express 
the relative separation in terms of transverse physical 
scale R. If, in addition, the redshift distribution of 
the background galaxies, or their actual redshifts, are 
known, then one can relate the shear distortion jt to 
AE(i?) = E(R) - T,(R), where E(iJ) is the_ surface mass 
density at the transverse separation R and E(i?) its mean 
within R, via 
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(5) 



(6) 



where vl and rs are the comoving distances to the lens 
and source, respectively, and r^s is the comoving dis- 
tance between the two (we work with comoving units 
throughout the paper) . If only the probability distribu- 
tion for source redshifts is known then this expression 
needs to be integrated over it. In principle the rela- 
tion between angular diameter distance and measured 
redshift depends on cosmology, but since we are dealing 
with low redshift objects varying cosmology within the 
allowed range makes little difference. We will assume a 
cosmology with tt m = 0.3 and Q\ = 0.7. 



We will now overview the formalism of GS02, begin- 
ning the discussion with a simplified description. Let us 
assume that a given halo of mass M produces an average 
lensing profile AE(i?, M). This can be obtained from 
a line of sight integration over the dark matter profile, 
which in this paper is modeled as an NFW profile J2{j 



tight and we model this component with a delta-function, 



p{r) 



(r/r s )(l+r/r s y 



(7) 



This model assumes that the profile shape is universal in 
units of scale radius r s , while its characteristic density 
p s at r s or concentration Cd m = r v /r s may depend on 
the halo mass, which here will be modelled as Cd m = 
10(M/Af nl )-° 13 miH. We will define the virial radius 
as the radius within which the density is 180 times the 
mean density of the universe. Note that this definition 
depends on O m : we will adopt VL m — 0.3. Since most of 
the signal is at R > 50 — 100/i _1 kpc, baryonic effects can 
be neglected, dark matter profiles are well determined 
from simulations and concentration or the choice of the 
halo profile does not play a major role. The average g-g 
lensing signal for a galaxy with luminosity L is 



(AE)(R;L) = J p(M;L)AZ(R,M)dM. 



(8) 



From above we see that the same conditional mass prob- 
ability distribution p(M; L) enters in both the lensing 
signal and in the expression for bias. One measures the 
function (AE)(i?;L); since the profile for individual ha- 
los is known one can invert the relation in equation [H] to 
obtain p(M;L). 

Given the noisy measurements of the g-g lensing signal 
we cannot invert the conditional mass probability distri- 
bution with arbitrary precision, so we must assume some 
functional form for it and then fit for its parameters. 
We wish to model the probability distribution p(M; L) 
in as model independent way as possible. We will begin 
with the simplest physically motivated model and then 
add more parameters to see how the results change. By 
relaxing the assumed functional form we can test the ro- 
bustness of the final results on the model assumptions. 

The simplified description so far ignores the fact that 
there are two distinct galaxy types that need to be mod- 
eled separately. The first type are the galaxies that 
formed at the centers of dark matter halos, such as the 
so called field galaxies or CDs sitting at the cluster cen- 
ters. The second type are the non-central galaxies, such 
as satellites of Milky Way type halos or group and cluster 
members. We know that a galaxy of a given luminosity 
can be of either type, so we split p(M; L) into two parts, 
p c and p NC , representing respectively central and non- 
central galaxies, with the fraction of non-central galaxies 
in each luminosity bin Li given by a free parameter on, 



p(M;Li) = (1 - a,) p G {M-L. i ) + a l p NC (M;L,) . (9) 

For the central galaxy population we assume that the 
relation between the halo mass and galaxy luminosity is 



p G (M;L t )dM = S D (M 



M 0:i )dM 



(10) 



where Mo,j are 6 more free parameters (we will be work- 
ing with 6 luminosity bins). In reality this component 
should have some width both because of intrinsic scatter 
in the M — L relation and because we work with luminos- 
ity bins of finite width. We will ignore this, since explicit 
tests have shown that the results are only weakly affected 
even if the scatter is more than a factor of two in mass 
[20j. Instead, we will use simulations to account for any 
such effects. 

The non-central galaxies are different in that they have 
presumably formed in smaller halos which then merged 
into larger ones. It is thus reasonable to assume that 
their luminosity is not related to the final halo mass. In- 
stead we assume a relation between the number of these 
non-central galaxies and the halo mass: the larger the 
halo the more satellites of a given luminosity one ex- 
pects to find in it. We assume this relation is a power 
law, (N)(M;L) oc M e , above some minimal halo mass 
-Mmin, which should be larger than the halo mass of the 
central galaxy component above, since we are assuming 
that there is already another galaxy at the halo center. 
Below this cutoff the number of galaxies quickly goes to 
zero. These assumptions imply 



„NC( 



dn 



p—(M;Li)dM ex F(M)M e '—dM . 



(11) 



In GS02 we have chosen F(M) = Q H (M~M mhhi ) where 
-^min,i = 3Mo.i, while here we will use a slightly more 



realistic functional form where e — 2 below M„ 



We 



have verified that the two expressions do not differ sig- 
nificantly in final results. Semi-analytic models of gala xy 
formation [2(J,|3ij, subhalos in N-body simulations |4J| as 
well as explicit comparisons with simulations J41J agree 
with this model and predict that for most galaxies e ~ 1 
and a ~ 0.2. 

For the non-central component the weak lensing pro- 
file AS(i?, M) is a convolution of the halo profile with 
the radial distribution of the galaxies, which we assume 
to be proportional to the dark matter profile, c g = acd m - 
Observationally there is not much evidence for any de- 
partures from a — 1 and we can test it using lensing data 
itself. Since we are explicitly excluding the central galax- 
ies the non-central galaxy component of the g-g lensing 
signal does not peak at the center. Instead, for a given 
halo mass, it is small at small radii, peaks at a fraction 
of virial radius and then drops off at large radii. A given 
halo mass peaks at a given scale, so by measuring the 
signal over a broad range of scales one can extract the 
relative contributions of different halo masses (note that 
the signal of lensing from neighboring halos can be ne- 
glected for R < 2Mpc). See GS02 for a more detailed 
discussion of the predictions of our model for lensing. 

The remaining uncertainty is how much of the dark 
matter around non-central galaxies remains attached to 



them. Since their fraction a is typically low (a < 0.2) the 
correction due to this is small and is limited to the inner 
region with R < 200/i _1 kpc. We assume the dark matter 
was tidally stripped in the outer parts of the halo, but re- 
mains unmodified in the inner parts of the satellite halo. 
Effectively this means that each non-central galaxy also 
has a central contribution, which we model in the same 
way as for the central galaxies (i.e., as a halo with mass 
M 0}i before stripping) out to 0.4r v ; r and totally stripped 
beyond that, in which case AS oc Br 2 . This cutoff is 
equivalent to having 50% of mass stripped and agrees 
with simulation results in next section. 

With this parametrization the mean bias in a given 
luminosity bin is 



b{Li) = (1 - Oi)6(M ,i) + a.. 
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( 12 ) 
In the simplest form the parametrization for the condi- 
tional halo distribution function only has two parameters 
at each luminosity, Mo,i and a.%, with the other param- 
eters fixed to their expected values. Since we will be 
working with 6 luminosity bins this implies a 12 param- 
eter parametrization, which is already significantly more 
than in previous analysis of this type J26J. Even with 
the order of magnitude increase in the data size not all 
of them can be determined with high statistical signifi- 
cance. We will begin with these 12 parameter fits and 
then allow ej and c g to vary to see its effect on the final 
result. Our goal is to make the model description as non- 
parametric as possible and to show that our conclusions 
are robust against different parameterizations. 

The halo model of GS02 is phenomenological and needs 
to be verified and possibly calibrated on simulations. A 
detailed comparison has been presented elsewhere [41j : 
here we simply highlight the results that are of most rel- 
evance for the present study. Overall, the halo model is 
able to extract the relevant information from the simula- 
tions remarkably well. We find that the satellite fraction 
is determined to better than 10%. The simulations repro- 
duce well the N(M) parametrization and indicate e = 1. 
We also find that if simulations have little or no scatter 
in the mass luminosity relation then the halo model is 
able to extract the halo mass to within 10%. However, if 
there is a significant scatter then our assumption that the 
central mass distribution is a delta function breaks down 
and there is no unique definition of halo mass. In the case 
of a severe scatter there may be a significant difference 
between median and mean mass and weak lensing anal- 
ysis determines something in between the two J42J. For 
the purpose of bias we wish to determine the mean mass 
or, even better, the full halo mass distribution. Thus the 
halo mass determined by the lensing analysis has to be 
increased. Here we apply corrections as derived in |41| by 
direct comparison to numerical simulations. At the faint 
end these corrections are small, while for the brightest 
bin we apply up to a 50% increase. These corrections 
are somewhat uncertain since we do not know the exact 



amount of scatter, so we also add a gaussian scatter with 
rms 0.5 of the correction factor to the masses from the 
bootstrap resamplings to account for the additional un- 
certainty due to the scatter in mass luminosity relation. 
We emphasize again that even a 50% correction in mass 
has only a small effect on halo bias below the nonlinear 
mass. 



III. DATA ANALYSIS 

We wish to determine the lensing-constrained predic- 
tion for bias as a function of luminosity and compare it to 
the observations. In this section we present the required 
procedure to achieve this goal. This involves four steps: 

1. Determine the galaxy bias for each luminosity bin 
from the galaxy clustering analysis. This step was al- 
ready done in [j|. 

2. Determine the conditional halo mass probability 
distribution from the weak lensing analysis for the same 
luminosity bins (i.e., determine the allowed values for <Xi, 
Mo,i, and possibly e.;). 

3. Compute the predicted biases and their associated 
errors separately for each luminosity bin by varying over 
all possible configurations of the conditional halo mass 
probability distribution consistent with the data. 

4. Compare the observed bias and the predictions to 
place constraints on the cosmological model. 



A. Galaxy clustering analysis 

The Sloan Digital Sky Survey [43| uses a drift-scanning 
imaging camera [44j and a 640 fiber double spectrograph 
on a dedicated 2.5m telescope. It is an ongoing survey 
to image 10,000 sq. deg. of the sky in the SDSS ugriz 
AB magnitude system J45|, |46| and to obtain spectra for 
~ 10 6 galaxies and ~ 10 5 quasars. The astrometric cal- 
ibration is good to better than O'.'l rms per coordinate 
|47|. and the pho tometric calibration is accurate to 3% 
or better [48L |49| . The data sample used for the clus- 
tering analysis was compiled in Summer 2002 and is all 
part of data releases two [5(J]. This sample consists of 
205,443 galaxies. For our purpose the data are divided 
into 6 luminosity bins specified in table 1. 

Galaxies are selected for spectroscopic observations us- 
ing the algorithm described in [5l|. To a good approx- 
imation, the main galaxy sample consists of all galaxies 
with r-band apparent Petrosian magnitude r < 17.77. 
These targets are assigned to spectroscopic plates by an 
adaptive tiling algorithm [53. The spectroscopic data 
reduction and redshift determination are performed by 
automated pipelines. 

The power spectrum analysis is described in detail in 
yj. It involves several steps. In the first step the group 
finding algorithm identifies all groups and clusters, which 
are then isotropized to remove finger of god effects. In 
the next step linear decomposition into 4000 KL modes 



Table 1 - The tabic summarizes the luminosity subsamples used in our analysis, listing evolution and k corrected absolute magnitude 

Mo.l,, (for h = 1), mean redshift z in the lensing sample, number of foreground galaxies used in lensing analysis, observed bias relative to 

Mo.v,, = —20.8 and its error, theoretically predicted bias for 2-parameter models &2p and its error a^ 2 and theoretically predicted bias 

for 3-parameter models b^ p and its error <j(, 3 . Both fits are for a model with Mq = 5.6 X 10 12 /i — Mq, which corresponds to erg = 0.9, 

f2m = 0.27 model. For other values see figure[2] Mo.tr was computed from magnitude r and redshift z assuming a flat cosmological 

model with f^A = 0.7. Apparent magnitude cuts are 14.5 < r < 17.77. 



Sample name 



Abs. mag 



Mean redshift # of galaxies b/fc„ cr b i bt &2p Cb 2 ^3p a b- 



3E_ 



LI 


-18 < Mo.tr < ■ 


-17 


0.023 


4,912 


L2 


-19 < Mo.tr < " 


-18 


0.035 


15,920 


L3 


-20 < Mo.tr < ' 


-19 


0.072 


49,505 


L4 


-21 < Mo.tr < - 


-20 


0.107 


88,405 


L5 


-22 < Mo.tr < - 


-21 


0.151 


55,440 


L6 


-23 < Mo.tr < - 


-22 


0.205 


6,000 



0.723 0.073 0.67 0.04 0.67 0.04 

0.764 0.123 0.77 0.05 0.77 0.05 

0.873 0.077 0.82 0.03 0.83 0.03 

0.969 0.054 0.85 0.03 0.85 0.03 

1.106 0.063 1.04 0.05 1.05 0.05 

1.631 0.119 1.94 0.20 1.92 0.25 



is performed, which maximize the signal on large scales. 
These are then used in the quadratic estimation of the 
power spectra. The redshift space distortions (modeled 
using linear theory) are analyzed in terms of their velocity 
power spectrum, which is estimated together with the 
galaxy power spectrum and the cross-correlation between 
the two. This analysis is performed for each of the 6 
luminosity bins. 

The result are 6 power spectra with similar shapes, but 
offset amplitudes. To quantify this similarity of shapes, 
one fits each of the measured power spectra to the refer- 
ence ACDM curve with the amplitude freely adjustable. 
All six cases produce acceptable fits with reduced x 2 of 
order unity, and the corresponding best-fit normaliza- 
tions and associated errors are given in table 1 and shown 
in figure They are normalized relative to L* galaxies 
with Mo.tr — —20.8 and are expressed in terms of linear 
amplitude of fluctuations as in figure [3] 



B. Weak lensing analysis 



In this section we briefly review the weak lensing analy- 
sis. More details are given in a separate publication, |29| . 
The basic model has already been outlined above. For 
each luminosity bin we parametrize the conditional halo 
mass probability distribution p(M; L) with a few free pa- 
rameters that we determine from the observed galaxy- 
lensing correlation function. We compute this correlation 
function in several radial bins and use random sample 
catalogs and bootstrap resampling of real galaxies to de- 
termine the covariance matrix of these bins. We use the 
same 6 luminosity bins as in the clustering analysis. We 
use the SDSS sample compiled in Summer 2003 for this 
analysis, which consists of 279,616 galaxies, somewhat 
larger than what was used in the clustering analysis, but 
with identical selection criteria. 

For the background galaxies we use two samples for 
which the shape information has been extracted from the 
images using the re-Gaussianization method of 28], with 
the implementation described in |29j. We require this 



shape information to be available in at least two colors. 
In the first sample are those with assigned photomet- 
ric redshifts, obtained by KPHOTOZ v3.2 J53, which are 
used for brighter galaxies with r < 21. In second sam- 
ple are the fainter galaxies with 21 < r < 22, for which 
only the expected redshift distribution is known. Details 
of the photometric redshift error distributions for the 
brighter source sample and the redshift distributions for 
the fainter sample are given in [23. The typical redshift 
distribution of the background sample is 0.1 < z < 0.6. 

We use a minimization routine to determine the model 
parameters. A detailed description of the model and its 
reliability when applied to simulated data is presented 
in 41], here we just show the results in figure 2] for the 
two parameter fits, virial halo mass satellite fraction a. 
Since we only estimate two parameters at a time the min- 
imization always converges to the global minimum. We 
see that the model is an adequate description of the data, 
which is confirmed by the x 2 values (for LI to L6, the x 2 
values are 47, 74, 36, 36, 51, and 44 respectively for 44 
degrees of freedom) . Note that for all but the faintest bin 
there is a clear detection of the signal and both the cen- 
tral and noncentral components are determined. In some 
of the brightest bins the signal to noise in these detections 
is enormous compared to previous analyses of this sort 
(26j. The virial mass of the central component scales 
nearly linearly with luminosity in all but the brightest 
bin (there is no detection in the faintest bin), while the 
non-central fraction is roughly constant at a — 0.13 ex- 
cept in the faintest bin where it is much lower and in 
the brightest bin where it is much higher. These are 
the values assuming M u \ = 8x 10 12 Mq. This value in- 
creases as the nonlinear mass decreases, since the mass 
function is exponentially cut-off above nonlinear mass, so 
the abundance of high mass halos is reduced and to fit 
the observed signal the fraction a must increase. We do 
the fits on a grid of values for M n \ that spans the range of 
interest. In the faintest bin no detection of the noncen- 
tral component is obtained and the central component 
is marginal. However, the resulting upper limits are still 
useful, since they imply that a majority of these galaxies 
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cannot live in massive halos, otherwise we would have 
detected a stronger lensing signal. For low mass halos 
the bias is only weakly dependent on mass, so even an 
upper limit leads to a strong constraint on the bias. In 
fact, if the low mass plateau where b ~ 0.65 — 0.7 could 
be reached then one could determine the absolute bias 
directly just from this low luminosity population with- 
out any additional modeling. In practice whether or not 
this low mass limit is reached with the range of halo 
masses we can probe here depends on the value of non- 
linear mass, but the fact that the bias is flattening at the 
faint end does place useful constraints when compared to 
observations, where the same trend is observed. 

We repeat the analysis by adding more parameters 
to the fit. The most relevant parameter for bias is e, 
which changes the relative proportion of less massive 
versus more massive halos for the non-central compo- 
nent. We find that we cannot determine all 3 parameters 
separately at each luminosity bin, so that e is strongly 
anti-correlated with a. For example, we find that even 
solutions with e = 2 are allowed, suggesting a large frac- 
tion of massive halos. However, at the same time a is 
reduced, so that the overall fraction of massive halos is 
more or less unchanged. This is exactly what one would 
expect, since the signal at R = 300 — 2000/i _1 kpc is mea- 
suring directly the contribution from these massive halos 
(groups and clusters). In other words, the 3-parameter fit 
is essentially a non-parametric fit to the data with large 
degeneracies between the parameters, but little variation 
in the halo mass probability distribution in the relevant 
range. 

We also tried a different nonparametric fit, where we 
divide the halo masses into several bins and fit for the 
fraction in each separately. Not surprisingly we find huge 
degeneracies between individual components in this case, 
especially at the low mass end. Even in this most gen- 
eral case the fraction of galaxies residing in large halos 
(above 10 1 Ii~ 1 Mq) is constrained to be below 0.2 at the 
bright end and below 0.01 at the faint end. It is particu- 
larly important that the fraction of galaxies in these high 
mass halos is strongly constrained, as these halos have 
the largest bias and a poor constraint on them leads to 
a larger error on the bias predictions. The variations of 
bias consistent with these various halo mass probability 
distributions are in fact very small. 

While for standard analysis we assumed that radial 
distribution of galaxies is the same as that of dark matter, 
we also explored other possibilities. Weak lensing data 
have some sensitivity to determine the radial distribution 
directly, since a shallow galaxy distribution also leads to 
a shallow radial dependence of AE |4l|. However, this 
only makes a difference at large radii and the central mass 
determination only weakly depends on it. We find that 
the central halo mass changes by less than 10% if c g is 
varied by a factor of 2 from its assumed value Cd m - 

One possible source of systematical error is the shear 
calibration from measured ellipticities. For this analysis 
using re-Gaussianization, as shown in [29j, when we in- 



clude all sources of shear calibration error (not just the 
2-3% PSF dilution correction), we are left with a roughtly 
10% shear calibration error (2<r). 

The redshifts of background galaxies are another pos- 
sible source of systematic error. For those galaxies that 
have photometric redshifts the main difficulty is knowing 
the error distribution, since even a relatively minor frac- 
tion of outliers can skew the distribution and lead to a 
bias in the lensing signal. This is particularly problematic 
for the brighter galaxies (r < 21) for which the typical 
redshift is 0.2-0.4. We use direct matching of SDSS ob- 
jects with deeper spectroscopic surveys (DEEP2) to cal- 
ibrate our photometric redshifts. An independent anal- 
ysis using Luminous Red Galaxies (LRG) for which we 
know the redshift error distribution, provides an indepen- 
dent confirmation of our photoz calibration. Details of 
the LRG redshift distributions are given in J54J, and the 
comparison with the other methods used for this paper 
is in |2<j. 

The redshift distributions as a function of magnitude 
are relatively well measured at these magnitudes, so one 
could use those instead of photometric redshifts. How- 
ever, the existing redshift distributions that apply to the 
overall population of galaxies cannot be directly used in 
the lensing analysis, especially not at the faint end. This 
is because a significant fraction of galaxies are rejected 
or downweighted in the lensing analysis because they are 
too small to give reliable ellipticity measurements. These 
galaxies tend to be smaller and thus at a higher redshift 
relative to the overall population. So the effective red- 
shift of the "lensing weighted" population tends to be 
lower than that of the overall population at the same 
magnitude. The same effect may also lead to an under- 
estimate of as in weak lensing measurements of the power 
spectrum, where it can have much more damaging con- 
sequences. It is very difficult to account for this effect if 
one does not have the complete redshift information of a 
representative portion of the data. 

Besides the shear calibration and redshift distribution, 
there are several other sources of systematic error in the 
weak lensing signal: magnification bias, stellar contami- 
nation, intrinsic alignments, sky level determination er- 
rors. We use estimates of the values of these errors for our 
analysis from |29f. and place a 10% 1 — a overall calibra- 
tion error on the lensing signal on top of the (comparable) 
statistical errors. 



C. Bias predictions 

In the next step we take the results from the previous 
section to compute the predicted bias as a function of 
luminosity. Bias is a nonlinear function of the model pa- 
rameters and we wish to determine both the mean and 
the variance. The fits for the halo mass probability dis- 
tribution can be strongly degenerate, so a Gaussian ap- 
proximation for the fitted parameters is not necessarily 
valid. In addition, one must impose physical constraints 
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FIG. 1: Weak lensing signal AE(r) as a function of transverse separation r as measured from SDSS data, together with the 
best fit 2-parameter model for each of 6 luminosity bins. Also shown are the best fit values for halo virial mass M (in units of 
10 11 /i _1 Mq) and a, the fraction of galaxies that are non-central, assuming M n \ = 8x 10 12 Mq. The model fits the data well 
in all bins. The mass fits are what comes from the fitting procedure and are a typical halo mass somewhere between mean 
and median. For bias calculations they are increased by varying amounts to account for the difference between fitted mass and 
mean mass, as described in the text. 



such as a > and Mo > 0. Our approach is to use 
bootstrap resampling to determine the errors. We divide 
the observed area into 200 chunks of roughly equal area. 
We then bootstrap resample these, by randomly choosing 
200 chunks with replacement (so some of these are du- 
plicated) . We perform the fitting procedure as described 



above on each of the bootstrap realizations. We use the 
fitted parameters to compute the bias using eciuation ll2l 
Finally, we compute the mean and variance of the bias 
parameter by averaging over 2500 of these bootstrap re- 
samplings. 

Before discussing the results we need to address the 
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redshift evolution. Redshift evolution can affect the weak 
lcnsing analysis, the bias analysis and the clustering anal- 
ysis. Most of the galaxies are at low redshift up to z ~ 0.2 
(table 1), so any corrections due to the redshift evolution 
are small. Moreover, there are various competing effects 
that further suppress the redshift evolution effects. 

We work in comoving coordinates, so redshift effects 
are minor. One usually applies the evolution correc- 
tion to the galaxy luminosity to account for the fact 
that higher redshift galaxies are brighter because their 
stars are younger. We use the correction as given in |55j . 
adding the quantity 1.6(2 — 0.1) to the r-band absolute 
magnitudes. This correction is not important as long 
as we use the same definition of luminosity in our weak 
lensing analysis as is in the clustering analysis. The def- 
inition of the virial mass as defined by FOF algorithm of 
simulations is in comoving coordinates: FOF algorithm 
finds clusters with a density contrast of order 6~ 3 rela- 
tive to the mean, where 6 is the linking length of FOF. 
This definition does not vary with redshift in comoving 
coordinates. The main effect of redshift evolution is that 
nonlinear mass at higher redshifts is lower, which affects 
the mass function and bias predictions. We perform the 
analysis on a grid of nonlinear masses defined at z=0 and 
for each use the nonlinear mass value appropriate for the 
median redshift for a given luminosity bin. Regarding the 
clustering evolution with redshift, for L» galaxies a typi- 
cal redshift is 0.1, so once we find bias for these galaxies 
and multiply it with the observed amplitude of galaxy 
clustering we need to evolve this amplitude of fluctua- 
tions to redshift 0, which increases it by roughly 5% in a 
ACDM universe. 

The results for 2-parameter models are shown in fig- 
ure |2 for several values of the nonlinear mass M n \, span- 
ning the range from 1.6 x 10 11 H~ 1 Mq (corresponding to 
ct 8 = 0.6, Q m = 0.2 model) at the top to 2.5 x 10 13 /i _1 Mq 
(corresponding to cr§ = 1.1, fl m — 0.3 model) at the bot- 
tom (we vary as from 1.1 to 0.6 from bottom up assum- 
ing fl m = 0.3, the top one has as = 0.6 and fl m — 0.25). 
Note that the weak lensing determination of a depends 
on M n \, since the cluster contribution depends on the 
cluster mass function: for higher value of nonlinear mass 
the exponential cutoff in the mass function is at a higher 
mass and so the fraction of galaxies in this component 
can be lower to match the observational constraints. We 
include this by performing the lensing analysis on all the 
values of M n \ of interest and use that information when 
computing bias predictions as a function of M n \ . 

One can see that as a consequence of the weak lensing 
determination of the halo mass distribution models with 
low nonlinear masses (low as and/or f2 m ) predict higher 
bias than those with higher nonlinear masses. Notice how 
the models with high nonlinear mass predict almost con- 
stant bias with luminosity, a consequence of bias being 
independent of mass below 0.1M n i. On the other hand, 
if nonlinear mass is close to the halo mass of L* galaxies 
then the bias is rapidly changing with luminosity. 

We can address the robustness of the bias predictions 




FIG. 2: This figure shows the lensing-constrained model pre- 
dictions for bias as a function of nonlinear mass using the 
2-parameter models of the halo mass probability distribu- 
tion. More general models of the halo probability distribu- 
tion give very similar results and are not shown here. The 
nonlinear masses from top to bottom are 3.4 x 10 h~ Mq, 
6.2 x 10 11 /i~ 1 M Q , 1.7 x 1O 12 6 _1 M , 4.0 x lO 12 ft _1 M0, 
8.0 x lO 12 /i _1 M , 1.5 x 1O 13 6 _1 M and 2.4 x lO 13 /i _1 M . 
Errors have been suppressed (see Table 1). 



by comparing the results between 2 and 3-parameter 
models. These are shown in table 1 for nonlinear mass 
M nl = 5.6 x 10 12 /i _1 Mq, corresponding to the a 8 = 0.9, 
Q m = 0.27 model. 3-parameter models are very degen- 
erate and often give unlikely values such as e = (corre- 
sponding to the case where the number of galaxies within 
a halo is constant regardless of halo mass). This is com- 
pensated by increasing the value of a so that the data 
are still fit well. Remarkably, the mean bias and its error 
hardly change from 2-parameter models to 3-parameter 
models in all bins. This demonstrates that the bias pre- 
dictions are robust against the parametrization of the 
halo probability distribution. 

Figure shows the predicted values for galaxy cluster- 
ing amplitude eg for models with a s varying from 0.6 to 
1.1, with as — 0.88 in bold. This is obtained by taking 
predicted bias values and multiply them with ag. Also 
shown is the observed values of as which are obtained by 
taking 6/6* values in table 1 and multiplying them with 
as{Mo.i r = —20.8) = 0.875 value as obtained from |j|. 
The latter is almost independent on cosmological param- 
eters and has a statistical error of 0.013. The first thing 
to notice is the agreement between the theoretical predic- 
tions and the observations, with theoretical predictions 
being slightly lower than observations at L4. Theoreti- 
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FIG. 3: The halo bias predictions of galaxy fluctuation am- 
plitude <rf a as a function of luminosity varying linear matter 
amplitude as: 0.6, 0.7, 0.8, 0.9, 1.0, 1.1, from top to bottom 
on the right hand side. The remaining parameters have been 
fixed to ilm =0.3 and n s — 1. Squares are for model with 
cts = 0.88, n s = 1.0 and Q. m — 0.3. For this model we show 
errors from theoretical modelling. Also shown as triangles are 
the observed values of <r| a . 



cal models predict both the gradual flattening of bias for 
galaxies fainter than L4, as well as a rapid increase in bias 
from L5 to L6, both of which are observed in SDSS data. 
This rapid increase in bias from L5 to L6 is caused both 
by the rapid increase in bias above the nonlinear mass as 
well as by the rapid decrease in star formation efficiency 
for the most massive halos: going from L5 to L6 we in- 
crease the luminosity by a factor of 2.5, while the halo 
mass has increased by a factor of 6.5 (figure ^), and the 
fraction of non-central galaxies has also increased. So 
the scaling between halo mass and luminosity becomes 
much steeper at the bright end and most of the galaxies 
in the [—22, —23] bin reside in group and cluster halos 



with masses above 10 13 /i 1 Mq. 

This agreement between the theoretical bias predic- 
tions and observations suggests that a fundamental pre- 
diction of large scale structure models, that of the bias 
dependence on halo mass, has been confirmed. Very low 
values of M n \ predict that bias is rapidly changing with 
mass over the observed range, which is not observed. 
On the other side of the mass range, very high values 
of M n \ predict bias is changing very slowly with mass, 
which would also be in contradiction with the observa- 
tions. These results constrain as, but these constraints 
by themselves are not very strong. In fact, one can find 
good agreement for er 8 ~ 0.7 — 1.0. 



At first this agreement between theoretical predictions 
and observations is so good it is almost disappointing, 
since the two agree over a broad range of M n \. While 
this result confirms the basic prediction of the structure 
formation models it would appear that it does not allow 
us to place useful constraints on cosmological models. 
This conclusion is too pessimistic, since nonlinear mass 
is also changed by varying f2 m and slope n s . For exam- 
ple, reducing Q m to 0.27 from 0.3 and n s to 0.96 from 
1 reduces nonlinear mass by a factor of 2 and increases 
theoretical bias predictions by 10% at L4. This brings 
the observations into a better agreement with theoretical 
predictions and improves the fit to x 2 ~ 7 at as = 0.88. 
The data thus favor slightly lower values of il m or n s 
than their canonical values of 0.3 and 1, respectively. We 
note that this analysis is strongly sensitive on the accu- 
racy of the bias as a function halo mass relation and we 
would find very different conclusions using expressions in 
[la. llSfl over the more recent ones in [Hj . 

These heuristic arguments are formalized in the next 
section, where we incorporate the bias constraints into 
the parameter estimation procedure. It is clear from this 
discussion that the bias constraints depend on nonlinear 
mass, which in turn depends on several cosmological pa- 
rameters such as as, Q m and n s , so the best strategy is 
to perform a full analysis over the parameter space of 
interest. 

However, it is worth highlighting where the strongest 
constraints are coming from and explore if the bias deter- 
mination of flux limited sample would improve the cos- 
mological constraints. One can see from figure|21that for 
LA in the range as = 0.6 — 0.9 all models give essentially 
the same value of as(LA) ~ 0.75, which is lower than the 
observed value of 0.85. Similarly, we can take the predic- 
tions for bias as a function of luminosity and weight it 
by galaxy numbers (given in table 1) to obtain the bias 
prediction for the flux limited sample. This has the ad- 
vantage that it can be related to the observed value which 
has very small errors, ct 8 (Mo.l,. = -20.8) = 0.88 ± 0.02. 
However, the predictions are almost independent of as'- 
we find 0.77 at as — 0.7 and 0.73 at as = 1 for fi m = 0.3 
and n s = 1. The fact that the predictions are lower 
than observed value again argues that these two param- 
eters (or shape parameter T which depends on Hubble 
parameter h as well) must be lower. Therefore, using 
the flux limited sample amplitude one cannot determine 
the bias and as despite a relatively small error on the 
observationally determined amplitude of galaxy cluster- 
ing. The reason for that is the degeneracy in the way 
bias changes with amplitude of fluctuations: reducing as 
by 10% reduces M n \ by a factor of 2 and increases bias 
predictions around M = M n \ by 10%, so the product 
bas, which determines the galaxy clustering amplitude, 
remains unchanged. We find that by using the full range 
of luminosity breaks this degeneracy, because the bias 
is very slowly changing with nonlinear mass on one end 
and very strongly changing with it on the other end of 
luminosity range. 
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D. Bias error budget 

We will include the following \ 2 component in the over- 
all likelihood evaluation. 



x 2 = 



E 



Qmodcl.i -K(b/K)iY 



(13) 



where 6 mo dci.i is the predicted bias for the i-th luminos- 
ity bin and Ob modcl , the corresponding error, (b/b*)i is 
the observed bias at the same luminosity and <Jb/b, A the 
corresponding error and cr sys = 0.03 accounts for system- 
atic uncertainties in the theoretical modeling of the bias 
and its variations with the model |19| . For a given model 
we first compute M n \ and then interpolate between the 
values shown in figure [21 to obtain 6 mo dci.i and 0h modcl , . 
Note that 6* is one of the parameters we are varying and 
is constrained both by the % 2 above and by the over- 
all amplitude of the galaxy power spectrum. For 6 mo dei,i 
we use the 2-parametcr fits as given in table 1, although 
using the 3-parameter fits would give almost identical 
results. 

Equation ^5] contains 3 contributions to the bias er- 
ror. First is the error on the theoretical bias predic- 
tion, en, , , . . This error is dominated by the uncertain- 

' "model, i •' 

ties in the conditional halo mass probability distribution 
p(M;L). Despite significant uncertainties in the proba- 
bility distributions (specially for the 18 parameter fits) 
the resulting values of <r 6 , , . are between 0.02-0.08 for 

O "model, l 

A/ n i ~ 10 13 /i _1 Mq. This is typically smaller than the er- 
rors on the observed bias o&/& i, which range from 0.054 
to 0.12. We assign a systematic uncertainty a sys = 0.03 
to all of the bins except the brightest one (L6), where we 
use cr sys = 0.1 to account for larger variations in model 
predictions, as well as larger systematic uncertainties due 
to the rapid variation of the bias with luminosity and red- 
shift. The systematic error is subdominant compared to 
the clustering or lensing error. Current bias constraints 
are mostly dominated by the observational uncertainties 
in the bias from the clustering analysis and not by the 
modeling uncertainties of either the bias or conditional 
halo mass probability distribution as determined by weak 
lensing. Note that systematic uncertanties in calibration 
and redshift distribution have already been included in 
the lensing analysis and corresponds to about 0.03 in 
bias. The current statistical error in the bias, averaging 
over all 6 bins in table 1, is 0.03, so the systematic error 
is at most equal to the statistical error. 

Much of the error budget in Objb, ,% is due to the sam- 
pling variance. We are treating the bias estimates as un- 
corrected between luminosity bins, assuming they come 
from independent volumes. In reality there is some over- 
lap in volume between neighboring luminosity bins and 
some of the same large scale structure contributes to two 
bins at the same time (see e.g. figure 3 in 9]). Be- 
cause of this our error estimation is conservative, since 
for overlapping regions sampling variance between lumi- 
nosity subsamples should be reduced. The reduction is 



however rather modest even for very large scales |l9j . 
Note that systematic uncertainties lead to some correla- 
tions between the errors, which we ignore in the present 
analysis since they are small. It would be straightforward 
to generalize upon this by computing the correlations be- 
tween the bootstrap samples. 



IV. COSMOLOGICAL PARAMETER 
DETERMINATION 



In this section we include the bias constraints in the 
parameter determination procedure to see if we can con- 
strain cosmological models better than without this in- 
formation. We combine the constraints from the SDSS 
power spectrum with CMB observations from WMAP 
[561 1571 l58| . We implement the Monte Carlo Markov 
Chain method Hj using CMBFAST version 4.5 1 \f^, 
outputting both the CMB spectra and the correspond- 
ing matter power spectra P(k). We evolve all the matter 
power spectra to a high k using CMBFAST and we do 
not employ any analytical approximations. In addition, 
we use linear to nonlinear mapping of the matter power 
spectrum using expressions given in [61] . 

Our implementation of the MCMC is the same as in 
62]. It is independent of that used in J5J, but we verified 
that the results for the case of WMAP+SDSS without 
bias agree. A typical run is based on 48 independent 
chains, contains 50,000-200,000 chain elements and re- 
quires 2-4 days of running on a 48 processor cluster in 
a serial mode of CMBFAST 2 . The success rate was of 
order 30-50%, correlation length (as defined in [5j) 10-30 
and the effective chain length of order 3,000-20,000. We 
use 23-39 chains and in terms of Gelman and Rubin R- 
statistics |^3 we find the chains are sufficiently converged 
and mixed, with R < 1.05, ie we are more conservative 
than the recommended value R < 1.2. 

Our pivot point is at k = 0.05/Mpc and we use the 
tensor normalization convention in which for the sim- 
plest inflationary models the tensor to scalar ratio is 
r = T/S = —8riT- Our most general parameter space 



p= (T,Wb,u} m ,m v ,Qx,1l,n a ,a a ,T/S,K), 



(14) 



1 available at cmbfast.org 

2 While CMBFAST is parallelized with MPI wc found that running 
it in parallel results in about a factor of 2 penalty on 8 processors 
(and more if more processors are used), mostly due to the fact 
that the highest fc-modes take the longest to run. The current 
implementation distributes fc-modes to individual processors, so 
the master node must wait for the slowest k mode to finish before 
the final assembly. For 48 processors the additional premium due 
to the required burn-in of each chain does not offset this penalty, 
so one is better off running CMBFAST serially. If significantly 
more processors were used the cost of burn-in would increase and 
one would be better off running CMBFAST in parallel in 8 node 
batches. 
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where r is the optical depth, u)b = Qbh 2 is proportional to 
the baryon to photon density ratio, w m — fl m h 2 is pro- 
portional to the matter to photon density ratio, m„ is 
the massive neutrino mass, fl\ is the dark energy density 
today and w its equation of state, 1Z is the amplitude of 
curvature perturbations at k — 0.05/Mpc, n s is the scalar 
slope at the same pivot and a s = dn s /d\nk is the run- 
ning of the slope, which we approximate as constant. We 
fix the tensor slope nr using T/S = —8tit- We do not al- 
low for non-flat models, since curvature is already tightly 
constrained by CMB and other constraints, which leads 
to Q K = 0.02 ± 0.02 for the simplest models [iij]. For 
the more general models, such as those with dark energy 
equation of state, relaxing this assumption can lead to a 
significant expansion of errors. We are therefore testing 
a particular class of models with K — and not present- 
ing model independent constraints on equation of state. 
We follow the WMAP team in imposing a r < 0.3 con- 
straint. Upcoming polarization data from WMAP will 
allow a verification of this prior. From this basic set of 
parameters we can obtain constraints on several other 
parameters, such as the baryon and matter densities Q& 
and Q m , Hubble parameter h and amplitude of fluctua- 
tions eg. Since we do now allow for curvature one has 
Oa = 1 — fJm an d we use fl m in table 2. In fact, our pri- 
mary parameter is S , the angular scale of the acoustic 
horizon, which is tightly constrained by the CMB. Simi- 
larly, although we use 1Z as the primary parameter in the 
MCMC we present the amplitude in terms of the more 
familiar erg. 



The basic result for two different MCMC runs are given 
in table 2 for SDSS combined with WMAP. For most 
of the parameters we quote the median value (50%), 
[15.84%,84.16%] interval (±lcr), and [2.3%,97.7%] inter- 
val (±2cr). These are calculated from the cumulative one- 
point distributions of MCMC values for each parameter 
and do not depend on the Gaussian assumption. For 
the parameters without a detection we only quote a 95% 
confidence limit. All of the restricted parameter space 
fits are acceptable based on x 2 values, starting from the 
6-parameter model with no tensors, running or neutrino 
mass. Introducing additional parameters does not im- 
prove the fit significantly. However, we wish to determine 
the amplitude of fluctuations in an as model independent 
way as possible and for this reason we explore the most 
general parameter space possible. 



Below we discuss the results from this table in more de- 
tail. Our standard model has 6 cosmological parameters: 
t, uji,,ui m , fl\ = 1 — Cl m ,H, n s , plus "nuisance parame- 
ter" bias 6* . For the case without bias our results are in 
a broad agreement with those in J5|, although a slightly 
different treatment (modification of lowest multipoles in 
WMAP and inclusion of r < 0.3 constraint) does lead to 
small changes in the best fit parameters and their errors. 
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FIG. 4: 68% (inner, blue) and 95% (outer, red) contours in 
(as,b*) plane using SDSS+WMAP+bias measurements. The 
two parameters are strongly correlated because only their 
product is determined from SDSS clustering analysis. The 
additional bias constraint helps reduce the degeneracy. 



A. erg and bias 

Figure 0] shows the 68% and 95% contours in the 
(crg,6*) plane. The two parameters are strongly corre- 
lated because the SDSS power spectrum constrains their 
product to be b^as = 0.87 ± 0.02. Both bias and eg 
are in a good agreement with the SDSS+WMAP analy- 
sis without bias constraints, which gives for the basic 6- 
parameter model 6* = 0.96±0.08 and a s = 0.92±0.09 @], 
but which in the presence of massive neutrinos changes 
to b* = 1-06 ± 0.10 and a 8 = 0.82 ± 0.09. We find 
as = 0.88 ± 0.06 and 6* = 0.99 ± 0.07 in the presence 
of neutrinos and running with bias constraints included. 
There remains a significant degeneracy between as and 
optical depth r, as shown in figure |SJ The upcoming 
WMAP polarization analysis may help improve this de- 
generacy. 

Let us now compare these constraints to other meth- 
ods to determine bias and as- The closest analysis to 
ours is that of the WMAP+2dF bispectrum using k ~ 
(0.2 — 0.3)/i/Mpc modes. Our a s is in a good agreement 
with the WMAP+2dF analysis with the bias constraint 
from the bispectrum, which gives a$ = 0.84 ± 0.04. The 
2dF bispectrum analysis has been performed on scales 
smaller than the scale of the power spectrum analysis 
(k < 0.2/i/Mpc), so many of subsequent analysis papers 
chose not to adopt this constraint. The fact that a com- 
pletely independent approach presented here finds the 
same result is therefore encouraging and suggests that 
the systematics are not dominating the statistical errors 
in either approach. 

There are several other methods of as determination, 
such as cluster abundance, weak lensing and Sunyaev- 
Zeldovich effect. Cluster abundance estimations of erg 
range from 0.6 to 1, often with very small errors (see a 
recent overview of the current situation in J5J). Many of 
these are reported for Q m = 0.3, so if the actual value is 
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Table 2: median value, la and 2cr constraints on cosmological parameters combining CMB, SDSS power spectrum shape and SDSS bias 

information. The columns compare different theoretical priors. The parameters for 7 parameter models are 

r, u)i,,u) m , Q\ = 1 — Q m ,'R.,n s , plus "nuisance parameter" bias 6». 



10 3 ^ 6 



6, s 



<T,x 



T/S 



7par 



23.4 



+1.2+2.5 
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+0.027+0.053 
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n Q07+0. 037+0. 071 
U.»O(_ .031-0 .055 



n lo-i +0.068+0.110 
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n QO/i+0. 070+0. 129 
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1.02 
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FIG. 5: 68% (inner, blue) and 95% (outer, red) contours 
in (<J8,t) plane using SDSS+WMAP+bias measurements. 
There is a correlation between the two, so a better determi- 
nation of optical depth r from polarization data would help 
improve the constraints. 



somewhat lower the required value for erg increases. The 
main difficulty is in calibrating the mass-temperature re- 
lation, which cannot be done with simulations, because 
these still lack some of the physics of cluster formation 
such as cooling, feedback, conduction etc. Direct cali- 
bration from the mass and temperature measurements 
on individual clusters is more promising, but is limited 
by statistical and systematic errors. With this method 
the results are particularly sensitive to calibration errors 



at various steps of the analysis, so the challenge for the 
future will be to control them at the required level. 

Weak lensing observations also have a similar spread 
in reported values of eg (between 0.7 and 1, see overview 
in |5j)- The assigned statistical errors are larger, so there 
may not be much conflict among different observations. 
In addition, with this method there are also systematic 
calibration effects at the 10-20% level in erg. Some of 
these are discussed in in the context of present analysis 
in §111. B, but a similar discussion applies to other weak 
lensing analyses as well. These are often not included in 
the error budget. As discussed above, these arise both 
for shear calibration from ellipticity measurements and 
for the redshift distribution of background galaxies. 

Finally, CBI [H HI and BIMA H3 measurements of 
the CMB at high I find excess power, which can be in- 
terpreted as a Sunyaev-Zeldovich signal. If so this would 
require a fairly high normalization, with estimates of erg 
ranging from 0.98±g ^ HU to 1.04±0.12(stat) ±0.1(sys) 
|68J | , where both errors are 2-sigma. Some of the differ- 
ence between the two estimates is due to the drop of the 
CBI amplitude in the latest analysis, while the rest is 
due to the differences in the modeling of the signal from 
either simulations or analytic models. 

In summary, our value of erg is consistent with most 
recently reported values. It is at the lower end of what is 
required to explain the CBI/BIMA excess power in terms 
of the SZ effect and at the upper end of some of the weak 
lensing and cluster abundance measurements. Each one 
of these may have additional systematic errors that could 
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bring the results into a better agreement. Our values are 
in excellent agreement with the WMAP+2dF analysis. 



B. Neutrino mass 

Galaxy surveys are important as tracers of neutrino 
mass, since neutrinos have a considerable effect on the 
matter power spectrum. At the time of decoupling such 
neutrinos are still relativistic, but become nonrelativistic 
later in the evolution of the universe if their mass is suffi- 
ciently high. Neutrinos free-stream out of their potential 
wells, erasing their own perturbations on scales smaller 
than the so-called free streaming length, defined as the 
distance at which a neutrino of a given rms velocity Uth 
can still escape against gravity. The velocity is c when 
neutrinos are relativistic and drops as 1/a afterwords be- 
cause of momentum conservation, so 



k R T 



Uth 



50(H-z)(m„/eV) _1 km/s. 



(15) 



Since the comoving Hubble time is proportional to tjj ~ 
[(l + z)il m ] _1 / 2 if^~ 1 the product of the two gives an esti- 
mate of the free-streaming length. The resulting comov- 
ing free-streaming wavevector is (for one massive family) 



k f „ = 0A(n m h 2 ) 1/2 (l + zr^^Mpc" 1 . (16) 

It should be evaluated at redshift when neutrinos become 
non-relativistic, since the dominant contribution comes 
from when neutrinos are relativistic. 

For a given wavevector k neutrino perturbations are 
suppressed while k > fe/ s . After that they can grow again 
and may even catch up with the matter perturbations. 
When neutrinos are dynamically important the neutrino 
damping also affects the matter fluctuations, decreasing 
their amplitude on scales below the free streaming length. 
One can see from equation^jjthat the scale is fairly large 
for the neutrino masses of interest, fcf s ~ O.lMpc -1 at 
z = for m v ~ leV. Below this suppression scale the 
power spectrum shape is the same as in regular CDM 
models, so on small scales the only consequence is the 
suppression of the amplitude (see figure HJJ). We can 
thus adopt the halo bias predictions from CDM mod- 
els and apply them to massive neutrino models as well. 
We should note that while qualitatively the effects are 
similar for 1 or 3 massive neutrino families, they differ in 
detail (figure |5J, so the constraints are not directly com- 
parable and one must do a separate analysis in the two 
cases. We mostly focus on 3 degenerate neutrino families 
here, but also present MCMC results for 3 masslcss+1 
massive family below. 

While it is commonly believed that massive neutri- 
nos have a minor effect on the CMB, this is actually 
not entirely the case for the masses of interest below 
2eV. This is because neutrinos with such a low mass are 
still relativistic when they enter the horizon for scales 
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FIG. 6: Top panel shows the change in CMB spectrum Ci 
for several neutrino masses relative to zero mass. The masses 
are m v — 0.15eV (solid, black), 0.3eV (dotted, red) and 0.6eV 
(long dashed, blue), all with 3 neutrino families of equal mass. 
Also shown (short dashed) is the case of 3 massless + 1 mas- 
sive neutrino family with m, = 0.9eV. Bottom shows the 
ratio of matter power spectra for the same models. We see 
that while increasing neutrino mass increases the CMB spec- 
trum it decreases the matter power spectrum. For the same 
total mass the 3+1 model is more non-relativistic at recom- 
bination, has a smaller effect on the CMB spectrum relative 
to 3 families of equal mass and the corresponding mass limits 
are weaker. 



around k = 0.1/i/Mpc and are either relativistic or quasi- 
relativistic at the time of recombination, z ~ 1100 (equa- 
tion I15J I . As a result neutrinos cannot be treated as a 
nonrelativistic component with regard to the CMB. Fig- 
ure El shows how much the CMB spectrum changes for 
various neutrino masses relative to the zero mass case, 
keeping O m = fl c d m + ^6 + &v constant (as well as the 
other cosmological parameters). One can see that for 
m„ = 0.3eV massive neutrinos increase the spectrum by 
6% at I — 200, well above the errors (neutrinos are also 
not degenerate with respect to the CMB if we compare 
them to a fixed fi c dm + ^& instead). In addition, massive 
neutrinos increase the CMB spectrum but suppress the 
power spectrum (figure[Sl , which enhances the sensitivity 
when the two tracers are combined. 

We note that our bias constraints significantly improve 
upon neutrino mass limits. In the absence of biasing 
constraints the neutrino mass limit from WMAP+SDSS 
is m v < 0.6eV if their masses are nearly degenerate J5[- 
Biasing constraints improve significantly upon this. We 
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find 



DISCUSSION AND CONCLUSIONS 



m v < 0.18eV(95%) (3 families, without running), (17) 

at 95% for a single component if we assume no run- 
ning, as was done in all of the work to date. Our con- 
straints improve upon WMAP+2dF constraints, where 
m v < 0.23eV was found by combining WMAP and 2dF 
with the bias determination from the bispectrum analysis 

However, this result was based on the assumption of no 
running. This assumption was present in all of the work 
so far, including [64| where it was argued that there is 
a weak evidence of running. A running spectral index 
changes the shape of the power spectrum as the massive 
neutrinos do, so including the running in the parame- 
ter estimation can significantly expand the limits. Naive 
expectations are that a negative running (which seems 
to be preferred by the data) suppresses the power on 
small scales just as a massive neutrino and so would lead 
to a tighter constraint on neutrino mass, but MCMC 
analysis does not confirm this and the constraints are 
weakened. With biasing constraints and running our con- 
straint changes to 

m„ < 0.24eV(95%) (3 families, with running). (18) 

All of the mass limits presented here are based on 3 
degenerate massive neutrino families. If one assumes a 
model with 3 massless families and 1 massive family (a 
sterile neutrino model), as motivated by LSND J6Jj, then 
the mass limits on the sum change, since both the CMB 
and and the transfer function change. One finds the lim- 
its are significantly weakened: in the WMAP+2dF anal- 
ysis without bias the limit is 1.4eV (70|. We find the 
same 

m„ < 1.37eV(95%) (3 + 1 families, no running). (19) 

The reason for the relatively weak constraint is that this 
case is much more degenerate with J7 m than the case of 
3 degenerate massive neutrino families. From the LSND 
experiment the allowed regions are four islands with the 
lowest mass m v — 0.9eV and the next lowest 1.4eV |71| . 
We see that the windows are rapidly closing with cosmo- 
logical constraints, but the case is not yet air tight. 

There were recent claims that the neutrino mass may 
have already been detected from cosmological observa- 
tions |72( . These claimed detections are inconsistent with 
our WMAP+SDSS or with WMAP+2dF constraints and 
are based primarily on the cluster abundance analysis of 
|73| . which seems to prefer a low value for the amplitude, 
erg ~ 0.7. As discussed above, cluster abundance esti- 
mates of as range from 0.6 to 1 and are likely to be dom- 
inated by systematics often not included in the quoted 
errors, such as mass-temperature calibration. Increas- 
ing the error on this method to account for systematics 
removes the evidence for neutrino mass. 



In this paper we presented a detailed comparison be- 
tween observations and theoretical predictions of one of 
the fundamental predictions of structure formation mod- 
els, that of linear bias as a function of halo mass. In 
doing so we combine two separate observational analyses 
of SDSS data, the galaxy clustering analysis and a weak 
lensing analysis, both as a function of galaxy luminos- 
ity. The former gives us the relative bias as a function 
of luminosity, while the latter connects the galaxies of 
a given luminosity class to their dark matter halo mass 
distribution. We find a remarkable agreement between 
the observations and theoretical predictions of bias over 
a range of halo masses from 10 n /i _1 Mq to 10 13 /i _1 Mq. 
This success should be viewed as an important new con- 
firmation of the current large scale structure paradigm in 
predicting the properties of the universe we live in. 

The second goal of this work is to provide a determi- 
nation of the bias for SDSS galaxies, which can be used 
to improve the cosmological parameter estimation. For 
any given model we can determine bias from the halo 
mass-bias relation and from the amplitude of galaxy clus- 
tering. The two must agree, which requires the bias of 
Afov = —20.8 galaxies to be very close to unity. As a 
result we can place constraints on the amplitude of fluc- 
tuations, as — 0.88 ± 0.06, as well as on the other other 
cosmological parameters. Our results are in an excellent 
agreement with the WMAP+2dFGRS analysis of HI]. In 
particular, we find no evidence for any systematic differ- 
ences between the SDSS and 2dF power spectra in either 
amplitude or shape. 

The systematic errors from the galaxy clustering data 
have been thoroughly examined in 9], but some open 
question remain to be addressed. One of them is the cor- 
rection for nonlinear effects in the power spectrum anal- 
ysis. These can affect both the conversion from redshift 
space to real space and from the real space power spec- 
trum to the linear power spectrum. The current analysis 
in [3 is based on the power spectrum with k < 0.2/i/Mpc, 
but this cutoff is somewhat arbitrary and should be jus- 
tified within a more realistic model, which will provide 
an estimate of the systematic error as a function of k. 
Currently the nonlinear corrections are based on the non- 
linear evolution model of [6l|. However, galaxies are not 
a perfect tracer of dark matter and the nonlinear cor- 
rection for galaxies could be different from that of dark 
matter. For example, in the context of halo models non- 
linear effects are entirely due to the correlations within 
the halos. If galaxies populate larger (smaller) halos than 
the dark matter then the nonlinear corrections will be 
larger (smaller). The halo model is not sufficiently accu- 
rate to address these questions in detail and simulations 
are needed instead. To put things in perspective, the 
overall galaxy clustering amplitude 6*178 from SDSS us- 
ing k < 0.2/i/Mpc data points has an error of 1.5%, while 
the nonlinear correction at k = 0.2/i/Mpc is around 10%. 
In this situation it does not take much for the systematic 
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error to dominate over the statistical error. However, 
much of the error will be on the overall amplitude and 
this is still limited by the error on our bias determina- 
tion, which is around 7%. Nonlinear effects are likely 
to be even more important for the luminosity dependent 
analysis of galaxy clustering, which was used in this pa- 
per as a basis for bias determination, but statistical errors 
in this analysis are larger and systematics may not dom- 
inate the results. It is clear that these issues have to be 
revisited if one is to believe the cosmological implications 
from these results. 

We have argued that the method presented here is in 
many ways more robust than some of the other meth- 
ods to determine the bias from observations. Still, there 
remain possible systematic errors in the present analy- 
sis that need to be explored further. Two uncertainties 
mentioned in the present analysis are the calibration of 
the weak lensing signal and the accuracy of the bias-halo 
mass relation. We have argued that the weak lensing 
method is robust in that even a 20% calibration error 
leads to only 0.03 error in bias. This does not dominate 
relative to the statistical error and we have included it in 
the analysis. Similarly, the bias-halo mass relation has 
been calibrated to an accuracy of 0.03 using a suite of 
large simulations covering some of the parameter space 
of interest [n| , but larger simulations and more extensive 
grid of parameter space is needed to improve this to an 
accuracy of 0.01. 

The current paper should be viewed as a first applica- 
tion of this method of bias determination, rather than the 
last word on it. There are many ways the current analysis 
could be improved. The most important among these is 
reducing the error on the clustering amplitude as a func- 
tion of luminosity, specially for low luminosity galaxies. 
As we argued most of the leverage for bias determination 
comes from the low luminosity galaxies, which reside in 
low mass halos, and for which the bias is only weakly de- 
pendent on the nonlinear mass (it is also relatively insen- 
sitive to errors in the lensing analysis). A better analysis 
optimized to reduce the sampling variance errors should 
reduce the errors considerably. Galaxies with absolute 
luminosities in the range [—18,-20] seem particularly 
promising, since they have a reliable lensing detection 
(figure^ and a weak bias dependence on nonlinear mass 
(figure |2|, unless the nonlinear mass is very low. Their 
clustering amplitude is currently very poorly determined 
compared to the overall sample (Table 1), which could 
be improved dramatically with a more careful analysis. 
In addition, the systematic errors in the lensing analysis 
could be reduced further. All of these aspects can be 
improved in the near future. This could lead to signifi- 
cant improvements on the cosmological parameters such 
as neutrino mass or equation of state. 

The present paper is only the first in several possible 
attempts to estimate the large scale bias in SDSS. An on- 
going project closest to our approach is to use the galaxy 
auto-correlation function on small scales to constrain the 
halo occupation probability distribution. A bispectrum 



analysis of SDSS galaxies is also in progress and should 
yield results which are statistically comparable to the 
present analysis. A weak lensing analysis on large scales 
can also determine the bias, although this would require 
larger survey areas than currently available and a tight 
control of all possible systematics. Current efforts are 
limited to small scales and their statistical power remains 
weak 12]. Finally, with better modeling of redshift space 
distortions the constraints on j3 may also improve beyond 
the current limits, which at the moment remain weak |£|. 
Combining and comparing these with the current analy- 
sis will provide additional checks of systematics in these 
methods. 

The method presented here can be applied to other 
samples of galaxies, such as those selected by color, spec- 
tral type or stellar mass. Of particular interest would 
be to apply it to the higher redshift galaxies, such as the 
Luminous Red Galaxies, which are very numerous in sur- 
veys such as SDSS and whose photometric redshifts are 
relatively accurate and well understood. Their cluster- 
ing amplitude on large scales can be determined with a 
high accuracy, close to 1%, in several redshift bins up to 
Z = 0.7. Without a model for bias the amplitude does not 
give useful information. If one could determine their halo 
mass distribution function with lensing that would allow 
one to predict the bias and thus extract the growth factor 
as a function of redshift. While the absolute calibrations 
of bias are still difficult at a 1% level, the relative cali- 
bration as a function of redshift may be more promising. 
This may be one of the most promising methods to place 
constraints on the dark energy equation of state and its 
evolution. 
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